Skip to main content
Research theme

Data Science Research Centre

Tampere University
Area of focusTechnology
Area of focusEcological innovations and social challenges

The research activities of the Data Science Research Centre are structured around four core themes. 

Fundamental Machine Learning:

Fundamental Machine Learning focuses on foundational questions in modern machine learning. We study both theoretical and experimental approaches that advance our understanding of how and why machines (can) learn, and propose methodological improvements leading to state-of-the-art solutions in applications of Artificial Intelligence. Our research areas include deep neural network design and training, generative modeling, neural architecture search, statistical (probabilistic) machine learning and inference, just-in-time and distributed inference, learning from multiple data sources, and nonlinear dimensionality reduction. Recent application areas of our research are machine and robot perception, financial data analysis, biomedical signal analysis, industrial data analysis, information retrieval, video coding and IoT.

Keywords: Deep Learning, Statistical Machine Learning, Generative Modeling, Multimodal Learning, Artificial Intelligence 

Team: Alexandros Iosifidis, Moncef Gabbouj, Tarmo Lipping, Jaakko Peltonen, Henri Pesonen, Ari Visa

 

Explainable & Fair Artificial Intelligence:

Explainable and Fair Artificial Intelligence addresses multiple perspectives of equitable and trustworthy artificial intelligence. One research line investigates the mathematical properties of neural models to enhance their interpretability. Another explores logical models that replicate the behavior of machine learning systems, providing rule-based representations of complex model reasoning. A further direction examines how rules can be learned directly from data and applied in real AI applications. In the area of Fair AI, the research focuses on developing fair recommender systems and retrieval-augmented generation (RAG) approaches that ensure equitable treatment of consumers and providers, as well as methods for fair visualization of complex data. Moreover, we tackle the challenge of fairness-aware entity resolution in dynamic environments while advancing explainability in these contexts.

Keywords: Explainable AI, Fair AI, Responsible AI, Transparency, Algorithmic Fairness, Fair Machine Learning

Team: Moncef Gabbouj, Gerardo Iniguez Gonzalez, Jyrki Nummenmaa, Jaakko Peltonen, Kostas Stefanidis, Ari Visa

 

Complex Systems & Network Science:

Complex systems are large collections of components – locally interacting with each other and their environment at small scales – that self-organise into global structures and behaviours at larger scales, even without central control. Components in complex systems form networks of interactions (neurons in the brain, computers in the Internet, humans in relationships) that make it difficult to study them in isolation or to accurately predict their future, due to nonlinear or chaotic dynamics. These interactions can be heterogeneous, dynamical, multilayered or interdependent, and lead to emergent behaviour at the system level. Complex networked systems also adapt to their environment at multiple scales, via cognitive, social, and evolutionary mechanisms, leading to robustness (or fragility) to external perturbations. The field requires a cross-disciplinary approach mixing physics, biology, ecology, social sciences, finance, business, management, politics, psychology, anthropology, medicine, engineering, information technology, and more, since complex systems in different domains often show universal features captured by the same mathematical and computational models.

Keywords: Networks of Interactions, Nonlinear Dynamics, Self-organization and Emergent Behaviour, Mathematical and Computational Modelling

Team: Frank Emmert-Streib, Gerardo Iniguez Gonzalez, Henri Hansen, Juho Kanniainen

 

Applied Machine Learning & Statistical Modelling:

Applied Machine Learning and Statistical Modelling focuses on developing and applying intelligent data-driven methods that bridge modern Artificial Intelligence and statistical reasoning to solve real-world problems across science, engineering, and industry. Methodologically, our research centers on deep learning, generative modelling, probabilistic machine learning, Bayesian methods, likelihood-free inference, nonlinear dimensionality reduction, and visualization of high-dimensional data. We study and advance methods in predictive modelling, time series analysis, graph-based learning, recommender systems, natural language processing and text analysis, exploratory data analysis, and data-driven decision making. Application areas of our research include healthcare and biomedical data analysis, financial modelling, industrial process optimization, social and behavioral data analysis, and smart systems.

Keywords: Time Series Analysis, Graphs, Natural Language Processing, Text Analysis, Likelihood-free Inference, Recommender Systems, Exploratory Data Analysis 

Team: Frank Emmert-Streib, Moncef Gabbouj, Gerardo Iniguez Gonzalez, Henri Hansen, Alexandros Iosifidis, Juho Kanniainen, Tarmo Lipping, Jaakko Peltonen, Henri Pesonen, Ari Visa, Arto Luoma, Jarkko Isotalo, Hyon-Jung Kim-Ollila

Computational Intelligence group (CoIn)

Professor Alexandros Iosifidis is leading Computational Intelligence Group. Its research focuses on designing, analyzing, understanding, and applying Machine Learning approaches in problems coming from Computer/Robot Vision and Perception, Finance, and graph analysis.

Signal Analysis and Machine Intelligence

In the field of Machine Learning, Prof. Gabbouj’s Signal Analysis and Machine Intelligence Group introduced a paradigm shift in ANN by extending the linear operation part of the perceptron to an arbitrary nonlinear function. In this way, Multilayer Perceptrons (MLP) were upgraded to Generalized Operational Perceptrons, and Convolutional Neural Networks (CNN) were extended to Operational Neural Networks (ONN).

Financial Computing and Data Analytics

Prof. Kanniainen is leading Financial Computing and Data Analytics group. His group, in collaboration with Profs Iosifidis and Gabbouj, has developed interpretable Machine Learning methods for Time-Series Modelling. The model is called Temporal Attention-Augmented Bilinear Network, which is highly interpretable, given its ability to highlight the importance and contribution of each temporal instance, thus allowing further analysis on the time instances of interest. Moreover, the group has developed network methods to model information cascades with partial observations on individuals’ states, which are applied for stock markets. These methods can be used to study how social relations drive investors in their decision making and to identify abuse of information in stock markets.

Natural Language Processing

Our Natural Language Processing group, led by Prof. Nummenmaa, has developed novel rule-based and machine learning approaches for question answering, to retrieve answers to natural language queries from big data and knowledge bases. The group has also developed methods for managing and analyzing grammatically parsed data and worked on different text mining tasks, such as frequent pattern mining and distinguishing pattern mining, including sequence mining for textual representations, suitable for mining biological data, represented as text. 

Recommender Systems

Recommender Systems tend to anticipate user needs by automatically suggesting the information which is most appropriate to the users and their current context. Prof. Stefanidis' Recommender Systems Group focuses on algorithmic approaches for traditional and more sophisticated scenarios, like group and sequential recommendations, developing and applying machine learning solutions, and building on both numerical ratings and textual reviews. Moreover, the group studies the big data integration and entity resolution problem for highly heterogeneous data, with a recent focus on progressive solutions for entity matching. 

Statistical Machine Learning and Exploratory Data Analysis

Prof. Peltonen’s Statistical Machine Learning and Exploratory Data Analysis Group focuses on designing and developing Statistical Machine Learning solutions for modeling and exploring data. This includes novel methods for modeling text and matrix data with topic modeling approaches, vectorial embedding approaches generalizing word embeddings, and novel matrix factorization solutions. His group also works on methods for information retrieval from large databases, including modeling and elicitation of user intent by Bayesian regression, probabilistic retrieval, and visualization of user intent.  

Data Analytics and Optimization

Prof. Lipping’s Data Analytics and Optimization group, located at the Pori Campus, develops deep learning and AI solutions for agriculture, health, and industry.

Tampere Complexity Lab

The Tampere Complexity Lab, Prof. Iñiguez’s research group in network science and computational social science, develops computational tools and mathematical theories to understand collective human behaviour by analyzing data and making models of social digital interactions available online. TaCoLAB uses an interdisciplinary, data- and mechanism-driven perspective to study group segregation in social networks, attitudinal polarization online, information diffusion, and the dynamics of ranked and hierarchical complex systems.

Predictive Society and Data Analytics

Prof. Emmert-Streib’s group, Predictive Society and Data Analytics Lab, conducts innovative research in data science with a deep appreciation for statistical thinking. The group studies a wide range of data types, e.g., genomics data, text data and network data by developing and applying methods from machine learning, AI and statistics. Current methodological focus is on learning paradigms, including transfer learning, multi-label classification and the digital twin, and deep learning architectures. Furthermore, the inference and analysis of networks is studied by network science.

Responsible Data Management and Ethical Artificial Intelligence

Profs. Nummenmaa, Peltonen, Elomaa, Stefanidis and Juhola focus as well on Responsible Data Management and Ethical Artificial Intelligence, where a rising concern is how to perform statistical data analysis and machine learning in an ethical, fair, transparent, and explainable manner. In this line of work, we also focus on enabling different stakeholders to query, understand and fix sources of bias in data science solutions, in an accessible and transparent manner. Methods for providing explanations that target at understanding the cause of unfairness and examine the capability to capture user intent that typically changes across sessions are developed.   

Multimedia and Data Mining

Prof Visa's Multimedia and Data Mining Group works with explainable machine learning or artificial intelligence. The main application fields for this technique are time series of hyperspectral signals or images.

Urban Physics Research Group

The way our urban areas are designed influences the amount of energy we consume, our exposure to environmental hazards such as pollution and climate change, and our health. The Urban Physics Research Group uses data science and physics-based models to understand how to design urban environments that are healthy and energy efficient, now and in the future.

Decision Support for Health

The group, led by prof. Mark van Gils, develops data-driven analysis methods to help healthcare professionals and patients get actionable information out of complex health-related data. The groups’ methods are typically based on combinations of biomedical signal processing, (explainable) AI and ML, and statistical analysis. Our methods are designed to work with real-life, suboptimal quality, data, and coming from different modalities. As specific domain examples, we have several decades of expertise in critical care decision making (intelligent patient monitoring, intervention planning) and chronic diseases (risk assessment, recommendation and motivation). Furthermore, we actively contribute to health ICT standardization initiatives.

Applied Statistical Data Analysis

Our Statistics Research is strongly connected with data science but has both its own distinct aspects within data science and its own core research separate of data science: in particular, solutions of tasks are carried out via statistical modeling, analysis of time-dependent data (timeseries, longitudinal), planning of data gathering, treatment of distributional assumptions, representation and management of uncertainty, probabilistic estimation and inference, prediction and hypothesis testing, and the research and theory of these core methodologies is unique to statistics. The Group of Applied Statistical Data Analysis conducts applied statistical research, where statistical methods are used and modified to solve research problems in different disciplines, for example in health, medicine, social sciences and technology.  

AI Hub Tampere

The principle of AI Hub Tampere is to make AI easy to reach and affordable, and thus all our services are free of charge, neutral and equal for all companies active in Pirkanmaa. The AI Hub is part of nationwide network of AI centres that is developing fast. Our aim is to assist local companies in boosting their competitive edge. Our focus has been on sustainable AI, health technology, and energy efficiency. The methods and devices we often work with include collaborative robots and self-driving vehicles.